3574 results found.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
63.14 MByte Production Status:
Existing-used
Use:
Document Classification, Text categorisation
-
Paper title:RANCC: Rationalizing Neural Networks via Concept Clustering
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Housam Khalifa Bashier | IMDB movie reviews | /N |
Documentation:
None
Not Applicable
Ontology,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
None Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:Expert Concept-Modeling Ground Truth Construction for Word Embeddings Evaluation in Concept-Focused Domains
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jelke Bloem | QuiNE-GT | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Not Available
License:
Size:
2,150,356 words Production Status:
Newly created-in progress
Use:
Text Mining
-
Paper title:Expert Concept-Modeling Ground Truth Construction for Word Embeddings Evaluation in Concept-Focused Domains
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jelke Bloem | QUINE corpus | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
50 MByte Production Status:
Existing-used
Use:
Summarisation
-
Paper title:Metrics also Disagree in the Low Scoring Range: Revisiting Summarization Evaluation Metrics
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Manik Bhandari | CNN/Dailymail dataset | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
7.1 MByte Production Status:
Existing-used
Use:
Summarisation
-
Paper title:Metrics also Disagree in the Low Scoring Range: Revisiting Summarization Evaluation Metrics
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Manik Bhandari | Text Analysis Conference | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English German
Availability:
Freely Available
License:
Creative Commons
Size:
4.5M sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Dynamic Curriculum Learning for Low-Resource Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Chen Xu | WMT 2016 English-German | /N |
Documentation:
N/A
Written
Corpus,
Language Type:
Bilingual
Languages:
English Japanese
Availability:
Freely Available
License:
Creative Commons Attribution-NonCommercial-NoDerivs 3.0
Size:
226834 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Dynamic Curriculum Learning for Low-Resource Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Chen Xu | IWSLT 2017 English-Japanese | /N |
Documentation:
IWSLT 2017 evaluation campaign: test sets for the MT Talk task
Written
Corpus,
Language Type:
Bilingual
Languages:
Chinese English
Availability:
Freely Available
License:
Creative Commons Attribution-NonCommercial-NoDerivs 3.0
Size:
213377 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Dynamic Curriculum Learning for Low-Resource Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Chen Xu | IWSLT 2015 English-Chinese | /N |
Documentation:
IWSLT 2015 evaluation campaign: training/development data
Written
Corpus,
Language Type:
Bilingual
Languages:
English Thai
Availability:
Freely Available
License:
Creative Commons Attribution-NonCommercial-NoDerivs 3.0
Size:
85019 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Dynamic Curriculum Learning for Low-Resource Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Chen Xu | IWSLT 2015 English-Thai | /N |
Documentation:
IWSLT 2015 evaluation campaign: training/development data
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
1.8 million articles OtherProduction Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Disha Jindal | The New York Times Annotated Corpus | /N |
Documentation:
None




